Features Extraction of Arabic Calligraphy using extended Triangle Model for Digital Jawi Paleography Analysis
نویسندگان
چکیده
The style of writing or calligraphy applied in ancient manuscripts gives useful information to paleographers. The information helps paleographer to identify date, writer, number of writers, place of origin, and the originality of manuscripts. This information is known as features. The features from characters, tangent value, dominant background and also Grey-Level Co-occurrence Matrix (GLCM) have been used in this field of research. A novel technique was proposed for digital Jawi Paleography. Jawi is the original Malay writing based on Arabic characters. The technique proposed models triangles on images and extracts features from them. The features are used for classification. In this paper, new features for the Triangle Model are proposed. Also, the implementation of four zones is reported. The number of features has been extended from 12 to 45. For validation of proposed algorithm, 60,000:20,000 training and testing data from Hoda digit dataset has been prepared, selected randomly for 10 rounds of testing. For further verification, two Supervised Machine Learning (SML) and three Unsupervised Machine Learning (UML) algorithms were experimented. These experiments were conducted using a new Arabic calligraphy dataset that was set up from 1,225 Arabic letters taken from calligraphy books. From the data, SML experiments were conducted with the ratio of 807:408 for training and testing. Whereas for the UML, three classifiers were tested on 30 images of Arabic calligraphy dataset. Results from the tests prove that the Triangle Model technique can successfully be used in digital paleography of Jawi characters.
منابع مشابه
Calligraphy Style Correlation Discovery Based on Graph Model and Its Applications
As more and more works of calligraphy exists in digital library, traditional browsing and searching are not satisfying. This paper presents an algorithm for calligraphy style correlation discovery based on graph model. We first segment calligraphy work into characters, extract their texture features through 64 Gabor channels, and estimate the calligraphy style using a probability multi-class SV...
متن کاملDigital surface model extraction with high details using single high resolution satellite image and SRTM global DEM based on deep learning
The digital surface model (DSM) is an important product in the field of photogrammetry and remote sensing and has variety of applications in this field. Existed techniques require more than one image for DSM extraction and in this paper it is tried to investigate and analyze the probability of DSM extraction from a single satellite image. In this regard, an algorithm based on deep convolutional...
متن کاملParallelization of Rich Models for Steganalysis of Digital Images using a CUDA-based Approach
There are several different methods to make an efficient strategy for steganalysis of digital images. A very powerful method in this area is rich model consisting of a large number of diverse sub-models in both spatial and transform domain that should be utilized. However, the extraction of a various types of features from an image is so time consuming in some steps, especially for training pha...
متن کاملFeature Extraction and Efficiency Comparison Using Dimension Reduction Methods in Sentiment Analysis Context
Nowadays, users can share their ideas and opinions with widespread access to the Internet and especially social networks. On the other hand, the analysis of people's feelings and ideas can play a significant role in the decision making of organizations and producers. Hence, sentiment analysis or opinion mining is an important field in natural language processing. One of the most common ways to ...
متن کاملComprehensive Analysis of Dense Point Cloud Filtering Algorithm for Eliminating Non-Ground Features
Point cloud and LiDAR Filtering is removing non-ground features from digital surface model (DSM) and reaching the bare earth and DTM extraction. Various methods have been proposed by different researchers to distinguish between ground and non- ground in points cloud and LiDAR data. Most fully automated methods have a common disadvantage, and they are only effective for a particular type of surf...
متن کامل